A survey on data‐efficient algorithms in big data era
نویسندگان
چکیده
Abstract The leading approaches in Machine Learning are notoriously data-hungry. Unfortunately, many application domains do not have access to big data because acquiring involves a process that is expensive or time-consuming. This has triggered serious debate both the industrial and academic communities calling for more data-efficient models harness power of artificial learners while achieving good results with less training particular human supervision. In light this debate, work investigates issue algorithms’ hungriness. First, it surveys from different perspectives. Then, presents comprehensive review existing methods systematizes them into four categories. Specifically, survey covers solution strategies handle data-efficiency by (i) using non-supervised algorithms are, nature, data-efficient, (ii) creating artificially data, (iii) transferring knowledge rich-data poor-data domains, (iv) altering data-hungry reduce their dependency upon amount samples, way they can perform well small samples regime. Each strategy extensively reviewed discussed. addition, emphasis put on how interplay each other order motivate exploration robust algorithms. Finally, delineates limitations, discusses research challenges, suggests future opportunities advance machine learning.
منابع مشابه
Data Stream Mining Algorithms in Big Data: A Survey
The infrastructure build in the big data platform is reliable to challenge the commercial and noncommercial IT development communities of data streams in high dimensional data cluster modeling. The APSO ie., Accelerated Particle Swarm Optimization is a technique which commonly known for data's are sourced to accumulate their continuation in the batch model induction algorithms which is not feas...
متن کاملThe Era of Big Spatial Data: A Survey
The recent explosion in the amount of spatial data calls for specialized systems to handle big spatial data. In this paper, we survey and contrast the existing work that has been done in the area of big spatial data. We categorize the existing work in this area from three different angles, namely, approach, architecture, and components. (1) The approaches used to implement spatial query process...
متن کاملA Survey of Bitmap Index-Compression Algorithms for Big Data
With the growing popularity of Internet applications and the widespread use of mobile Internet, Internet traffic has maintained rapid growth over the past two decades. Internet Traffic Archival Systems (ITAS) for packets or flow records have become more and more widely used in network monitoring, network troubleshooting, and user behavior and experience analysis. In this paper, we survey bitmap...
متن کاملBig Data: A Survey
In this paper, we review the background and state-of-the-art of big data. We first introduce the general background of big data and review related technologies, such as could computing, Internet of Things, data centers, and Hadoop. We then focus on the four phases of the value chain of big data, i.e., data generation, data acquisition, data storage, and data analysis. For each phase, we introdu...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Journal of Big Data
سال: 2021
ISSN: ['2196-1115']
DOI: https://doi.org/10.1186/s40537-021-00419-9